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A pandemic risk assessment of Middle East respiratory syndrome coronavirus 
(MERS-CoV) in Saudi Arabia. 


Abstract 


Since the initial emergence of Middle East respiratory syndrome coronavirus (MERS- 
CoV) in 2012, a high incidence rate has been observed in Saudi Arabia. This suggests 
that the country is at continuous risk. The epidemic level of MERS-CoV infection was 
examined in Saudi Arabia by the Susceptible-Infectious-Recovered (SIR) model using a 
Bayesian approach for estimation of time dependent reproduction number (R) across a 
two-year interval (May, 2013-May, 2015) in five defined clusters, followed by sensitivity 
analysis of the most significant clusters. Significant MERS-CoV peaks were detected in 
the period between March and May of each year. Moreover, MERS-CoV infection was 
highlighted in western (40.8%) and central (31.9%) regions, followed by eastern region 
(20%). The temporal-based Bayesian approach indicated a sub-critical epidemic in all 
regions in the baseline scenario (R: 0.85 to 0.97). However, R potential limit was 
exceeded in the sensitivity analysis scenario in only central and western regions (R: 1.08 

i jWe 


to 1.12) that denoted epidemic level in those regions. The impact of sporadic cases was 


A 



found relatively insignificant and pinpointed to the lack of zoonotic influence on MERS- 
CoV transmission dynamics. The results of current study would be helpful for evaluation 
of future progression of MERS-CoV infections, better understanding and control 



interventions. 
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Assessment of the Middle East respiratory syndrome coronavirus (MERS-CoV) 
pandemic risk in Saudi Arabia 


Abstract 


Since the initial emergence of Middle East respiratory syndrome coronavirus (MERS- 
CoV) in 2012, a high incidence rate has been observed in Saudi Arabia. This suggests 
that the country is at continuous risk. The epidemic level of MERS-CoV infection was 
examined in Saudi Arabia by the Susceptible-Infectious-Recovered (SIR) model using a 
Bayesian approach for estimation of time dependent reproduction number (R) across a 


two-year interval (May, 2013-May, 2015) in five defined clusters, followed by sensitivity 



analysis of the most significant clusters. Significant MERS-CoV peaks were detected in 
the period between March and May of each year. Moreover, MERS-CoV infection was 
highlighted in western (40.8%) and central (31.9%) regions, followed by eastern region 
(20%). The temporal-based Bayesian approach indicated a sub-critical epidemic in all 
regions in the baseline scenario (R: 0.85 to 0.97). However, R potential limit was 
exceeded the sJ^iv.ty a„a,ysis scenario in only centra, and western regtons (R: ,.08 
to 1.12) that denoted epidemic level in those regions. The impact of sporadic cases was 
found relatively insignificant and pinpointed to the lack of zoonotic influence on MERS- 
CoV transmission dynamics. The results of current study would be helpful for evaluation 



of future progression of MERS-CoV infections, better understanding and control 
interventions. 
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MERS-CoV; time dependent reproduction number; sensitivity analysis; outbreaks; 
sporadic cases 


Introduction 

Middle East respiratory syndrome (MERS) has become a global concern since it was 
recognized for the first time in 2012 as a primitive source of renal failure and severe 
respiratory sickness caused by a novel highly pathogenic coronavirus (Majumder et al., 
2014). Coronaviruses were not considered as a major public health threat before 2003, as 
they were known to cause only mild upper respiratory tract infections. The first known 
serious coronavirus infection was the severe acute respiratory syndrome coronavirus 
(SARS-CoV) in 2002, which caused an outbreak with approximately 8400 cases and 800 
deaths (World Health Organization, 2003). 

MERS-CoV pinpointed a zoonotic introduction of a novel coronavirus probably 
originating from bats into human populations (Sharif-Yakan and Kanj, 2014). The 
zoonotic origin of MERS-CoV was bolstered via phylogenetic analysis and elucidated a 
very proximal phylogenetic similarity with the bat Betacoronaviruses: BtCoV-HKU4 and 
BtCoV-HKU5 (van Boheemen et al. 2012) as well as identified of the cellular receptor 
(Raj et al., 2013). However, contact frequency between human and bats is highly limited 
in Arabian Peninsula. Subsequently other intermediate hosts were proposed, such as 
camels and goats (Raj et al., 2014). MERS-CoV was found circulating in dromedary 
camels from last 20 years (Corman et al., 2014), and MERS-CoV neutralizing antibodies 
were detected in camels (Reusken et al., 2013; Milne-Price et al., 2014). These findings 
indicated that camels could serve as the intermediate host for MERS-CoV zoonotic 
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infections in Saudi Arabia, Oman, Jordan, and United Arab Emirates. On the other hand, 
human-to-human transmission was frequently observed among MERS cases as most of 
the human infections were recorded among health care workers and within households 
proposed a close contact transmission (Hunter et al., 2016; Sharif-Yakan and Kanj, 

2014) . The World Health Organization (WHO) has reported 1038 MERS cases, mostly 
occurred in Arabian Peninsula involved 460 deaths from Saudi Arabia (Memish et al., 

2015) . Consequently, Saudi Arabia was considered as the epicenter of MERS-CoV 
infections (Alqahtani, et al. 2017). Subsequently several studies investigated the existing 
status and future evolution of pathogen to confront the probable outbreaks by studying 
MERS-CoV transmission (Breban et al., 2013; Poletto et al., 2014; Kucharski and 
Althaus, 2015; Abolfotouh et al., 2017). The current study examined the level of 
heterogeneity in MERS-CoV transmission via cluster analysis relying on the 
geographical distribution of MERS cases since it could clarify the linkage between 
epidemic status and geographical separation of clusters (Cauchemez et al., 2016). One 
useful indicator that is used to check viral transmissibility is the basic reproduction 
number (Ro) representing the number of secondary cases due to each index case in a fully 
susceptible population because no approved vaccine is available (Breban et al., 2013). 
Basic reproduction number sets the infectious agent’s potential to start an outbreak. 
When Ro is >1 epidemics takes off and the epidemic can diminish and die out when Ro is 
less than 1 (Chang, 2016). Moreover, reproduction numbers can be estimated at various 
times during an epidemic. It can be estimated at the beginning of an outbreak (initial 
reproduction number) or at any time during the outbreak (time-dependent reproduction 
number). Several methods are used to evaluate the initial reproduction number involving 
attack rate analysis, exponential growth (EG) method, maximum likelihood (ML) 
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estimation and sequential Bayesian (SB) method (Obadia et al., 2012). However, the 
time-dependent reproduction number can be computed by averaging over all transmission 
networks compatible with observations using the time-dependant method (Wallinga and 
Teunis, 2004). The attack rate method requires the least information, but it can be used 
only when the epidemic has finished, and additionally no further intervention can be 
conducted during the entire outbreak course. Thus, the use of this method is mostly 
limited to definite settings as schools or hospitals. In EG method, the exponential growth 
rate occurrence during the early phase of an outbreak can be concomitant to the initial 
reproduction ratio. The incidence data is integer estimated and Poisson regression is 
specified to evaluate this parameter rather than linear regression of the logged incidence 
(Boelle et al., 2009; Hens et al. 2011). ML method depends on the postulation that the 
number of secondary cases triggered by an index case is Poisson distributed with 
probable value R. In sequential bayesian method, the former distribution for R utilized on 
each new day is the subsequent distribution from the previous day. At each time, the 
mode of the posterior may be estimated along with the highest probability density 
interval. Similar to the previous methods, this method necessitates that the epidemic is 
occurring in a period of exponential growth (Obadia et al., 2012). On the other hand, 
overestimation of the initial reproduction number may be obtained in a case when 
epidemic is not observed from the initial case, because some index cases are not present 
in the epidemic curve. In ML method this problem was solved by a correction made for 
absent generations at the beginning of the epidemic curve and similar is recommended in 
the Bayesian setting (McBryde et al., 2008) by assuming a constant reproduction number. 
However, it is not possible for the TD method due to variation of reproduction ratio with 
the time. Thus, the current study relied on Bayesian method to follow up the epidemic 
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progress in Saudi Arabia and particularly in the absence of information around the index 
cases from two year data of MERS cases by assuming a random mixing in a fully 
susceptible population. 


Method 

Data sources 





,v 



The analysis focused on the progress of MERS-CoV in Saudi Arabia from May 2013 
and May 2015. A line list of MERS cases was compiled from the Saudi Arabian ministry 
of health (MOH) reports, published on the official MOH website 
( http://www.moh.gov.sa/en/CCC/PressReleases/Pages/default.aspx) . The list contained 
765 confirmed MERS cases. 

Data clustering 



The concept of cluster was 



geographical grouping into five major clusters: 


central area, northern area, western area, southern area, and eastern area. The provinces 
included in each cluster are shown in Table 1. 



MERS-CoV epidemiological parameters 


The incubation period was fixed at 5.2 days according to data acquired from hospital 
outbreak in Saudi Arabia (Assiri et al., 2013), and in line with information obtained from 
travel-related cases (Cauchemez et al., 2014). 


1. Estimation of the initial reproduction number using sequential bayesian method. 


All known data was used as a prior for next iteration. It relied on an approximation to the 
SIR model, whereby incidence at time t+ 1, N(t+ 1) was approximately Poisson 
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distributed with mean N(t)e 


(y(R- 1)) , -1 


(y represents the average duration of the infectious 


period) (Boelle et al., 2009). 

The formulation of probabilistic standard SIR disease transmission models is similar to 
the time-series SIR approach (Bjprnstad, et al. 2002), where all infectious and susceptible 
individuals are assumed to mix homogeneously and it simplifies to reconstruct the 
transmission chains. The standard epidemic susceptible infected model is expressed as: 

— = -/?-/ and - = /?-/- yl 

dt 'W dt ^ N ' 

S is the average number of susceptibles at time t, I is the average number of infections, N 
is the size of the population, which decreases due to disease-induced deaths, p is the 



contact rate. After an average residence time y , infectious individuals recover or die. 


The Bayesian procedure is engaged to convert time series of case numbers to a 
probability distribution. The proposed algorithm, described in a Bayesian framework, 
started with a non-informative prior on the distribution of the reproduction number R (fig. 
1 ). 

The distribution was updated as new data was observed, using the following 
equation: P(R/N 0 ,....,Nt)= P( A *+ 11R ^ Na ''' 1 ' Ni> (Obadia et al., 2012). An exponential 

growth for the epidemic period was assumed for this method. Moreover, this method 
assumed random mixing in the population. The package “Estimation of Ro and Real- 



Time Reproduction Number from Epidemics” in R software Version 1.6 was used to 
implement the Bayesian method. 


2. Sensitivity analysis 
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A sensitivity analysis was carried out to determine the impact of sporadic cases on the 
basic reproductive number in a specific cluster. Sensitivity analysis was performed by 
screening of the confirmed MERS-CoV cases that were found correlated to the index 
cases. Moreover, the sporadic cases were excluded in sensitivity analysis to estimate the 
impact of these cases on the spread pattern in the baseline scenario. The program code on 
the software R was adjusted by specifying the disease generation time. These adjustments 
allowed the computation of the basic reproductive number after a variation in the time of 
the initial emergence of the virus with transmission to humans (Poletto et al., 2014). The 
R results of the sensitivity analysis were compared to their corresponding baseline R 
results for each cluster (fig. 1). In current sensitivity analysis, we referred to the total 
count of cases involved in the study from May 2013 to May 2015 as the baseline. On the 
other hand, the cases in the outbreak periods were referred as outbreak cases. 

Sensitivity analysis was performed by considering: (i) Detailed definition of source 
region to provinces and belonging cities, including Riyadh, Qasim, Makkah, Madinah, 
Baha, Eastern Province, Tabuk, Jouf, Hail, Northern border, Asir, Najran and Jizan. (ii) 
Restricted five regions were used for data assembly and cluster-based interpretation and 
only the significant clusters data was utilized for sensitivity analysis, (iii) A variation in 
the time of the initial emergence of the virus with transmission to humans was considered 
maximum up to 5.2 days for inter-human transmission and it was s assumed that sporadic 
cases may introduced for a period of more than 5.2 days before the known initial cases, 
(iv) Daily sporadic cases detected in the baseline scenario during the period of 2 year 
study were excluded in the sensitivity analysis to test the possibility of inter-human 
transmission. 


Results 
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The epidemic curve of MERS-CoV infection in Saudi Arabia during the two-year-study 
period displayed a higher dynamic outbreak of MERS cases in the period between March 
and May of each year. Despite the highly significant MERS peak observed particularly in 
2014, the following year showed a lower number of MERS incidents (Figure 2). 


Cluster analysis based on geographical distribution of MERS cases depicted a higher 
representation of reported cases in the western (40.8 %) and central (31.9 %) regions. The 
lower number of cases was observed in the eastern region (20 %). However, the least 
significant cases were recorded in northern (4.6 %) and southern (2.7 %) regions (Figure 
3). The daily rate of sporadic cases was found the highest in central region (1.20, 95% Cl: 
1.07-1.41) in relation to western and eastern regions (0.08, 95% Cl: 0.06-0.12 and 0.05, 


95% Cl: 0.02-0.09) respectively. 


A temporal analysis of reproduction number (R) led to potential variation in R value 
among study clusters. Both central and western regions scored the highest R value (R: 
0.97, 95% Cl: 0.64-1.36) and (R:0.96, 95% Cl: 0.52-1.25) respectively followed by the 



eastern region (R: 0.85, 95% Cl: 0.57-1.05). R value was found at lowest level in the 


Northern and 



n regions respectively (R: 0.02 and 0.18). Therefore, the sensitivity 


analysis was performed for only western, central and eastern regions (Table 2). 


Sensitivity analysis demonstrated an insignificant impact of sporadic cases on the 
dynamic status of MERS CoV spread pattern. It was detected that the model used for the 
analysis was robust and that most of the cases were related and most likely evolved from 
index cases. 


The elevation of basic reproduction number (>1) was recorded in both central and 
western region during the most of the study period (fig. 4.1.b and 4.2.d) and after 
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exclusion of sporadic cases a significant decrease in the confidence interval was noted in 
comparison to the baseline scenario respectively (fig. 4.1.a and 4.1.c). However, the 
MERS-CoV status remained roughly steady below the threshold limit (R <1) even after 
exclusion of sporadic cases in the eastern region with a limited decrease in the confidence 
interval (fig. 4.2.e and 4.2.f). 

Discussion 

Results of the current Real Time Bayesian SIR model suggest a subcritical MERS-CoV 
epidemic in Saudi Arabia, as estimated by the reproductive number to be less than one. 
Subsequently, a self-sustaining epidemic cannot be established in humans, which would 



agree with the findings of other studies (Breban et al., 2013; Poletto et al., 2014). The 
potential outbreak of MERS-CoV in the period between March to May 2014 could be 
interpreted by the paucity of data about the index cases and subsequent waves of 
infections. Moreover, other possibilities must be included such as as seasonal variations 
and their correlation with zoonotic origins like infections in camels (Sharif-Yakan and 


Kanj, 2014). However, the low rate of daily, sporadic MERS cases diminished the 



possibility of ^zoonotic infection and indicated a more likelihood of inter-human 
transmissibnity that is plausible with data reported elsewhere (Breban et al., 2013). The 
limited human to human transmission has been reported due to the variations of MERS- 
CoV receptors in the human upper and lower respiratory tract (Raj et al., 2014) 



Furthermore, a potentially low respiratory disease could evolve in patients lacking 
considerable co-morbidities (Raj et al., 2014). 


The highest incidence of MERS-CoV cases was detected in western and central region 
followed by eastern regions. MERS incidence was found insignificant in both northern 
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and southern regions. Several factors like contact frequency, virus shedding, hospital 
procedures, population composition and density (Drosten et al., 2015) may lead to the 
higher incidence of cases in different regions. Therefore, the higher population density in 
the regions would be considered as a reason for high MERS-CoV incidence. 

The Cl calculated for the reproductive number was found varying considerably across 
the studied period in the regions undergone sensitivity analysis, beginning with a wider 
Cl and ending with a significantly narrower CL Reduced Cl in terms of time indicated 
more related secondary cases rather than sporadic case incidences. The current data 
suggested the less likelihood of external sources as zoonotic intermediates to participate 
in MERS-CoV spread dynamics. Current finding contradicted the results of Paletto, et al. 
(2014) who stated that the Cl for the estimated R was steady along the variations in the 
data interpretation. The discrepancy might be owing to the fact that the current study was 
restricted to MERS-CoV cases in Saudi Arabia rather than the Middle East region. 
Moreover, the variation of the baseline scenario estimates (up to R=0.97) with sensitivity 
analysis values (up to R=1.12) were interpreted by the insignificant contribution of 
environmental or zoonotic transmissions to the MERS-CoV outbreak. However, the 
impact of sporadic cases on the epidemic level of MERS-CoV over the threshold limit (R 
>1) was observed in both central and western regions. The finding of current study 
supported the data from previous study that animal contact was an uncommon cause for 
MERS-CoV in the detected cases (Memish et al., 2015). Some studies reported mild 
MERS cases and other reported a broad spectrum of clinical disease (Assiri et al., 2013; 
Health Protection Agency (HPA) UK Novel Coronavirus Investigation team, 2013). The 
present study might have a selection bias towards symptomatic cases or 
mild/asymptomatic cases that might lead to different spread patterns of MERS-CoV. 
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Screening programs for earlier detection of MERS-CoV silent infection could relate the 
sporadic cases to their index cases, which could affect directly the estimation of the 
probable waves of MERS-CoV infection (Widagdo et al., (2017). Moreover, it could aid 
in mapping of the primary cases destination as well as the high MERS-CoV incidence 
regions. Hence, traceability of MERS-CoV could be enhanced that might lead to support 
the convenient control measures. The second possibility is that these asymptomatic 
patients, if observed, could reveal the factors other than inter-human transmission that 
affect the MERS-CoV spread pattern, some previous studies reported dromedary camels 
as a potential intermediate host for MERS-CoV (Haagmans et al., 2014; Raj et al., 2014). 

The present study reported the highest R values (table 2) ever observed for MERS-CoV, 
however other studies reported R of up to 0.73 (Kucharski and Edmunds, 2015), 0.69 
(Breban et al., 2013; Poletto et al., 2014) and 0.63 (Cauchemez et al., 2014). The 
probable reasons for the surge of R value in the current study could be the absence of a 
vaccine or a treatment, besides the inefficient control measures in health care centers and 
hospitals as most of the observed cases were documented to be inter- and intra-hospital 
transmissions (Drosten et al., 2015; Oboho et al., 2015). Furthermore, Cauchemez, et al. 
(2014) mentioned that Ro could range from 0.8 to 1.3 in the absence of the infection 
control measures. The results of current study are in agreement with the findings of 
Cauchemez et al. (2014). A study in South Korea (Kucharski and Althaus, 2015) 
highlighted the risk of super-spreading events of MERS-CoV infection with relatively 
low basic reproductive number (R=0.47) and should be considered as warning for future 
outbreak events in Saudi Arabia with the prediction of relatively higher R value. 
Therefore, future tracking of infections would add value to our understanding of viral 
transmissibility pattern as well as contracting the CIs around the Ro value. Appropriate 


12 




ACCEPTED MANUSCRIPT 


monitoring of cases as well as enhanced traceability procedures are important to reduce 
transmission rate, diminish any possible opportunity of viral adaptation from human-to- 
human transmission and to obtain reliable data for periodical update of the R value (Min 


et al., 2016). 
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igure 1. Schematic diagram for the assessment of MERS-CoV pandemic risk in Saudi 
Arabia. 
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Figure 2. Epidemic curve of MERS-CoV infection in Saudi Arabia during the period 
between May, 2013 and May 2015 
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Figure 3. Geographical distribution of MERS-CoV cases in the five clusters in Saudi 
Arabia. 
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Figure 4.1. Bayesian skyline plot (BSP) showing the changes in basic reproductive 
number of MERS-CoV across time in a: central region baseline, b: Confirmed cases in 
central region, c: Western region baseline. The dashed black line indicates the median of 
R values estimated from the Poisson regression model. The gray shading indicates the 
95% Cl of the estimated R. 
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Figure 4.2. Bayesian skyline plot (BSP) showing the changes in basic reproductive 
number of MERS-CoV across time in d: Confirmed cases in Western region, e: Eastern 
region baseline and f: Confirmed cases in Eastern region. The dashed black line indicates 
the median of R values estimated from the Poisson regression model. The gray shading 
indicates the 95% Cl of the estimated R. 
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Table 1. Geographical clustering 


Region 

Provinces 


Central region 

Western region 

Eastern region 

Northern region 

Southern region 

Riyadh, Qasim 

Makkah, Madinah, Baha 

Eastern Province 

Tabuk, Jouf,, Hail, Northern border 

Asir, Najran, Jizan 




Table 2. The initial reproductive number values R and the daily rate of emergence of 
sporadic cases of MERS CoV. 

Analysis Data 

R (95% Cl) 

Daily Rate of 
Sporadic Cases 
(95% Cl) 

Baseline Baseline data: All reported cases by 

MOH in Saudi Arabia during the period 

May, 2013 to May, 2015 in only three 


regions: 

a) Western region 

b) Central region 

c) Eastern regio 

> 





j.1 

o 


0.96 (0.52-1.25) 
0.97 (0.64-1.36) 
0.85 (0.57-1.05) 


1.20(1.07-1.41) 
0.08 (0.06-0.12) 
0.05(0.02-0.09) 


Sensitivity 



All confirmed cases in only three 
regions of Saudi Arabia during the 
period May, 2013 to May, 2015 within 
IRS CoV incubation period of 5.2 
ind excluding the sporadic cases: 

a) Western region 

b) Central region 

c) Eastern region 


R: Initial reproduction number, Cl: Confidence Interval 



1.08 (0.84-1.20) 
1.12(0.94-1.47) 

0.97 (0.81-1.18) 
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